Discovering regularities in Big Data is nowadays a crucial task in many different applications, from bioinformatics to cybersecurity. To this aim, a promising approach consists in performing data clustering with Local Metric Learning, i.e. trying to discover well-formed (compact and populated) clusters and, at the same time, a suitable subset of features corresponding to the subspace where each cluster lies. In this paper, we propose a novel evolutionary agent based clustering algorithm, where agents act as individuals of an evolving population, each one performing a simple cluster analysis task on a different subset of patterns randomly drawn from the entire dataset. A customized genetic algorithm orchestrates the evolution of a population of such agents. This approach is able to reveal clusters that are not evident when using the whole set of features, and can deal with large datasets, as each agent will process a small subset of patterns. First results obtained considering a synthetic dataset are encouraging, suggesting future developments.
An evolutionary agents based system for data mining and local metric learning / Giampieri, Mauro; Rizzi, Antonello. - ELETTRONICO. - (2018), pp. 1461-1466. (Intervento presentato al convegno 19th International Conference on Industrial Technology ICIT 2018 tenutosi a Lyon, France nel February 20-22 2018) [10.1109/ICIT.2018.8352396].
An evolutionary agents based system for data mining and local metric learning
Mauro Giampieri;Antonello Rizzi
2018
Abstract
Discovering regularities in Big Data is nowadays a crucial task in many different applications, from bioinformatics to cybersecurity. To this aim, a promising approach consists in performing data clustering with Local Metric Learning, i.e. trying to discover well-formed (compact and populated) clusters and, at the same time, a suitable subset of features corresponding to the subspace where each cluster lies. In this paper, we propose a novel evolutionary agent based clustering algorithm, where agents act as individuals of an evolving population, each one performing a simple cluster analysis task on a different subset of patterns randomly drawn from the entire dataset. A customized genetic algorithm orchestrates the evolution of a population of such agents. This approach is able to reveal clusters that are not evident when using the whole set of features, and can deal with large datasets, as each agent will process a small subset of patterns. First results obtained considering a synthetic dataset are encouraging, suggesting future developments.File | Dimensione | Formato | |
---|---|---|---|
Giampieri_Evolutionary_2018.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
560.54 kB
Formato
Adobe PDF
|
560.54 kB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.